Readability Metrics for Machine Translation in Dutch: Google vs. Azure & IBM
نویسندگان
چکیده
This paper introduces a novel method to predict when Google translation is better than other machine translations (MT) in Dutch. Instead of considering fidelity, this approach considers fluency and readability indicators for ranked best. research explores an alternative the field quality estimation. The contributes by publishing dataset with sentences from English Dutch, human-made classifications on best-worst scale. Logistic regression shows correlation between T-Scan output, such as measurements like lemma frequencies, was Azure IBM. last part results section prediction possibilities. First logistic second generated automated learning model. Respectively, they have accuracy 0.59 0.61.
منابع مشابه
IBM statistical machine translation for spoken languages
We discuss performance enhancing techniques we have developed for the IWSLT 2005 Evaluation Campaign: (i) a phrase acquisition technique which expands the phrase boundaries to include target words aligned to null source words in a principled manner, and (ii) a system combination technique which selects the minimum cost translation output out of many translation outputs of the same input segment...
متن کاملAdvances in Machine Translation Research in IBM
IBM is engaged in advanced research and development projects on various aspects of machine translation, between several language pairs. The activities reported on hero are all parts of a rather large-scale, international effort, following Michael McCord’s LMT approach. The paper focuses on seven selected topics: recent enhancements made in the Slot Grammar formalism and the specific analysis co...
متن کاملStatistical Machine Translation with Readability Constraints
This paper presents experiments with document-level machine translation with readability constraints. We describe the task of producing simplified translations from a given source with the aim to optimize machine translation for specific target users such as language learners. In our approach, we introduce global features that are known to affect readability into a documentlevel SMT decoding fr...
متن کاملEvaluation Metrics for Machine Reading Comprehension: Prerequisite Skills and Readability
Knowing the quality of reading comprehension (RC) datasets is important for the development of natural-language understanding systems. In this study, two classes of metrics were adopted for evaluating RC datasets: prerequisite skills and readability. We applied these classes to six existing datasets, including MCTest and SQuAD, and highlighted the characteristics of the datasets according to ea...
متن کاملNL-Translex: Machine Translation for Dutch
NL-Translex is an MLIS-project which is funded jointly by the European Commission, the Dutch Language Union, the Ducth Ministry of Education, Culture and Science, the Dutch Ministry of Economic Affairs and the Flemish Institute for the Promotion of Scientific and Technological Research in Industry. The aim of this project is to develop Machine Translation components that will handle unrestricte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13074444